Indices options group wildcards #7

gmarouli · 2024-01-26T19:49:08Z

This PR is a draft alternative of elastic#103518 given that this refactoring is accepted.

We believe that this refactoring, makes the option to add the failure store to the current IndicesOptions a possibility.

The new builders allow us to (temporarily) rebuild IndicesOptions with the failure store without having to extend the boolean factory methods.
Using the builders means that the default option of not including the failure store indices is applied everywhere.
It will make adding this to search easier because there are a lot of request rewrites that we need to wire the DataStreamOptions while like this we can rely on the existing infrastructure.

I am starting to see potential here.

…ed (will move them to new class).

) The stream + collect operation on empty lists was causing 6% of all allocations during document parsing. Lets have 0-alloc methods to check for this dynamic fields before we do anything about them.

Even with Kahan summation, there is a tiny precisoin loss. We already fixed this before for GeoPoint by using the encode/decode to doc-values quantization in the final display results, and for CartesianPoint we fix by casting the coordinates to float for both the expected and actual results.

Using the Collections wrapper is less optimal than `new ConcurrentHashMap<V, Boolean>().keySet(Boolean.TRUE);` solution. With the collection wrapper, mod counts on the CHM are updated on NOOP add calls. For contested concurrent noop updated this solution was benchmarked to be more than 2x as fast and should outperform the Collections wrapper in essentially all cases (not just for the mod count update reasons, it also has fewer virtual calls).

…#104714) Fixes: elastic#104708

Moves some of the detail about S3 storage classes to their own section for easier linking, and adds a note about `intelligent_tiering` archive classes.

Misc tidy-up following elastic#104394: - This action only runs on the coordinating node, no need to define wire serialization for its request/response types. - No need to subclass `ActionType`, nor to define how to receive responses from remote clusters. - Moves to executing an `AbstractRunnable` to be sure to handle all failures (including threadpool rejections) properly.

elastic#104722) When advancing a datafeed's search interval past a period with no data, always advance by at least one time chunk. This avoids a problem where the simple aggregation used to advance time might think there is data while the datafeed's own aggregation has filtered it all out. Prior to this change, this could cause the datafeed to go into an infinite loop. After this change the worst that can happen is that we step slowly through a period where filtering inside the datafeed's aggregation is causing empty buckets. Fixes elastic#104699

* Starting cohere * Making progress on cohere * Filling out the embedding types * Working cohere * Fixing tests * Removing rate limit error message * Fixing a few comments * Update docs/changelog/104559.yaml * Addressing most feedback * Using separate named writeables for byte results and floats * Fixing a few comments and adding tests * Fixing mutation issue * Removing cohere service settings from named writeable registry

…ameters (elastic#104718)

Lower the upper bound of large response size from 2 times of suggestedMaxAllocationSize to 1.5 so that it be sent over with 3 messages. This is because the split threshold is 0.99 of suggestedMaxAllocationSize. Resolves: elastic#104728

…ards (elastic#104709) Remove +1L that allows the third-smallest shard to be also allocated on the node in case it is only 1b bigger than second-smallest

`ActionType` represents an action which runs on the local node, there's no need for implementations to define a `Reader<Response>`. This commit removes the unused constructor argument.

…#104741)

* Avoid eager task realization in esql qa projects * Fix eager task realization in PomValidationPrecommitPlugin * Make loadCsvSpecData task lazy created * Fix test task reference

… GET/POST requests (elastic#103683)" (elastic#104760) This reverts commit b4345d9.

…elastic#104725) Recently a user saw spurious delayed data warnings. These turned out to be due to accidentally setting `summary_count_field` to a field that was always zero. This meant that every document was considered delayed.

Co-authored-by: Elastic Machine <[email protected]>

…04623) * Functions E-Z * Incorporate changes from elastic#103686 * More functions * More functions * Update docs/reference/esql/functions/floor.asciidoc Co-authored-by: Liam Thompson <[email protected]> * Update docs/reference/esql/functions/left.asciidoc Co-authored-by: Liam Thompson <[email protected]> * Apply suggestions from code review Co-authored-by: Alexander Spies <[email protected]> * Review feedback * Fix geo_shape description * Change 'colum'/'field' into 'expressions' * Review feedback * One more --------- Co-authored-by: Liam Thompson <[email protected]> Co-authored-by: Alexander Spies <[email protected]> Co-authored-by: Elastic Machine <[email protected]>

A recent report shows that we can perform ESQL planning on the refresh thread pool after waiting for refreshes from search-idle shards. While the planning process is generally lightweight, it may become expensive at times. Therefore, we should fork off the refresh thread pool immediately upon resuming ESQL execution. Another place where we should fork off is after field_caps. I will look into that later.

* Add CO2 data for AWS ap-southeast-3 and me-central-1 * Update CO2 data for GCP including new regions * Update CO2 data for Azure including new/changed regions * Move provider data into CloudProviders.java * Add NOTICE and LICENSE for the provider data * Document CloudProviders' public functions * Add cloudcarbonfootprint-(NOTICE|LICENSE).txt as ignoreFile --------- Co-authored-by: Elastic Machine <[email protected]>

Build these more lazily avoiding putting them in an array and don't keep an accidental reference to the aggregator itself.

On reflection is was probably a mistake to give each `ChunkedRestResponseBody` a nontrivial lifecycle in elastic#99871. The lifecycle really belongs to the whole containing `RestResponse`. This commit moves it there.

…lastic#104547) * Allow both string and datetime as the third and fourth inputs to auto_bucket Committer: Fang Xing <[email protected]> * Allow both string and datetime as the third and fourth inputs to auto_bucket * Allow both string and datetime as the third and fourth inputs to auto_bucket * Allow both string and datetime as the third and fourth inputs to auto_bucket * Allow both string and datetime as the third and fourth inputs to auto_bucket * Allow both string and datetime as the third and fourth inputs to auto_bucket

data_stream/190_require_data_stream/Testing require_data_stream in bulk requests Awaiting fix from elastic#104774

…ns older than 8.13.0 (elastic#104780)

…04734) Reverts elastic#104597 Reverting due to elastic#104732, will reinstate it when the bug is fixed.

This shouldn't actually change anything, as the format has not been modified recently. This simply marks 8500009 as used by 8.12.1 to help with patches on the 8.12 branch

github-actions · 2024-01-26T19:49:20Z

Documentation preview:

✨ Changed pages

…sts testFold {TestCase=<double> #7} elastic#114175

gmarouli and others added 30 commits January 23, 2024 11:28

Mark the enum options IGNORE_ALIASES and ALLOW_NO_INDICES as deprecat…

e35d783

…ed (will move them to new class).

Group all wildcard options in one class.

9bb5ade

Fix class method failure

00c6c5b

Fix XContent serialization

764e8f3

Support conflicting XContent serialization expectations

93527cb

Support conflicting XContent serialization expectations

bea09bf

Merge branch 'main' into indices-options-group-wildcards

65a7035

[Connectors API] Implement update service type action (elastic#104643)

d5c0dcf

Remove explicit ALLOW_NO_INDICES, it is on by default

4323ac3

adding known issue for int8_hnsw (elastic#104664)

ef630a6

Speedup check for dynamic mappers when parsing documents (elastic#104698

9d3207c

) The stream + collect operation on empty lists was causing 6% of all allocations during document parsing. Lets have 0-alloc methods to check for this dynamic fields before we do anything about them.

Update IndicesOptionsTests and fix the related found bugs

d9088fa

Rename expandWildcards field

6c86aed

Also allow test_grok_pattern's content passed by query param (elastic…

324d35f

…#104714) Fixes: elastic#104708

Improve comments

37c4393

Improve S3 storage class docs (elastic#104599)

30f9639

Moves some of the detail about S3 storage classes to their own section for easier linking, and adds a note about `intelligent_tiering` archive classes.

Fix SamlAuthenticationIT flakyness (elastic#103867)

ab8ee60

Fix serverless docker setup compatibility (elastic#104724)

6b910db

AwaitsFix elastic#104728

9b4647c

merge with main

90acd5d

ESQL: Fix replacement of nested expressions in aggs with multiple par…

d1e5f72

…ameters (elastic#104718)

Fix testRestoreSnapshotAllocationDoesNotExceedWatermarkWithMultipleSh…

a7f2e2d

…ards (elastic#104709) Remove +1L that allows the third-smallest shard to be also allocated on the node in case it is only 1b bigger than second-smallest

Remove unused arg from ActionType ctor (elastic#104650)

1116889

`ActionType` represents an action which runs on the local node, there's no need for implementations to define a `Reader<Response>`. This commit removes the unused constructor argument.

Reinstate compat shim lost in elastic#104650

55ba6fe

gmarouli and others added 27 commits January 25, 2024 15:07

Extract GeneralOptions

cb68e44

[DOCS] Fixes asciidoc syntax in PUT trained models API docs. (elastic…

e48b549

…#104741)

Avoid eager task realization (elastic#103343)

209b655

* Avoid eager task realization in esql qa projects * Fix eager task realization in PomValidationPrecommitPlugin * Make loadCsvSpecData task lazy created * Fix test task reference

Revert "[Enterprise Search] Add .connector-secrets system index and…

05ea8c7

… GET/POST requests (elastic#103683)" (elastic#104760) This reverts commit b4345d9.

Fix writable

599a96b

Closing services (elastic#104726)

05c7377

Co-authored-by: Elastic Machine <[email protected]>

Preserve old serialisation for performance reasons

ba579f0

Build sub aggregation buckets more lazily (elastic#104762)

fc2bdc2

Build these more lazily avoiding putting them in an array and don't keep an accidental reference to the aggregator itself.

Extract concrete index options

a22e332

Make RestResponse releasable (elastic#104752)

03c9f89

On reflection is was probably a mistake to give each `ChunkedRestResponseBody` a nontrivial lifecycle in elastic#99871. The lifecycle really belongs to the whole containing `RestResponse`. This commit moves it there.

Mute data_stream test (elastic#104775)

2a5cd78

data_stream/190_require_data_stream/Testing require_data_stream in bulk requests Awaiting fix from elastic#104774

ESQL: add =~ operator (case insensitive equality) (elastic#103656)

79b7dbb

Merge branch 'main' into indices-options-group-wildcards

6051c0d

Add skip tags to three tests to prevent them from being run on versio…

d876ec7

…ns older than 8.13.0 (elastic#104780)

Revert "x-pack/plugin/core: make automatic rollovers lazy" (elastic#1…

807147d

…04734) Reverts elastic#104597 Reverting due to elastic#104732, will reinstate it when the bug is fixed.

Use the randomly generated options

e495f7f

Redefine index version 8500009 for use by 8.12.1 (elastic#104755)

d128481

This shouldn't actually change anything, as the format has not been modified recently. This simply marks 8500009 as used by 8.12.1 to help with patches on the 8.12 branch

Merge branch 'main' into indices-options-group-wildcards

d5fd6ee

Replace constructor usage with builders

99a6f89

Reduce number of constants

922781c

Use builder

3d7b292

Remove Writable from grouped IndicesOptions

c8e4c91

gmarouli closed this Jan 26, 2024

elasticmachine pushed a commit that referenced this pull request Oct 7, 2024

Mute org.elasticsearch.xpack.esql.expression.function.aggregate.AvgTe…

0c69de1

…sts testFold {TestCase=<double> #7} elastic#114175

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indices options group wildcards #7

Indices options group wildcards #7

gmarouli commented Jan 26, 2024

github-actions bot commented Jan 26, 2024

Indices options group wildcards #7

Indices options group wildcards #7

Conversation

gmarouli commented Jan 26, 2024

github-actions bot commented Jan 26, 2024